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PRELIMINARY AMENDMENT AND 
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Please amend the above-identified application before examination, as set forth below. 



IN THE SPECIFICATION AND ABSTRACT ; 

In accordance with 37 C.F.R. § 1.121(b)(3), a Substitute Specification (including the 
Abstract, but without claims) accompanies this response. It is respectfully requested that the 
Substitute Specification (including Abstract) be entered to replace the Specification of record. 



IN THE CLAIMS : 

On the first page of the claims, first line, change "What is claimed is:" to: 
- What Is Claimed Is :-. 



Please cancel original claims 1 to 15, without prejudice, in the underlying PCT 
AppHcation No. PCT/DE99/02770, and cancel substitute claims 1-15, without prejudice. 



Please add the following new claims: 



16. (New) A method for integrating audiovisual encoded information into a predefined 
frame-structured transmission standard, comprising the steps of: 

preprocessing the audiovisual encoded information into separate data streams; 

respectively supplying the audiovisual encoded information as the separate data streams; 

multiplexing the separate data streams into at least one data channel in the predefined 
fi-ame-structured transmission standard; 

exchanging capabilities of terminals that communicate with one another as to types of 
data capable of being one of encoded and decoded and as to encoding and decoding operations 
that are supported, after a connection is established; and 

signaling, in accordance with data structures of an encoding standard, specifications on 
the data type used, a decoding tool to be used, and encoding parameters including a data 
capacity. 

17. (New) The method according to claim 16, wherein: 

the data structures are selected so as to be identical. 

18. (New) The method according to claim 16, fiirther comprising the steps of: 
for object-based, audiovisually encoded information having a low niraiber of 

objects, causing a multiplexer to packetize a set of the separate data streams; and 

transmitting the packetized set of separate data streams in individual transmission 
channels in accordance with the H.245 standard. 

19. (New) The method according to claim 18, wherein: 

the multiplexer conforms to the H.223 standard, and 

the individual transmission channels include ITU channels. 

20. (New) The method according to claim 18, wherein: 

a data field assigns individual packets within the individual transmission 
channels assigned to the individual packets, and 

a data field in an MPEG-4 transmission contains an identification of 
individual elementary data streams. 
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21 . (New) The method according to claim 18, further comprising the step of: 

taking over directly from the multiplexing step a data format of MPEG-4 output data 
streams in order to avoid a further reformatting. 

22. (New) The method according to claim 18, further comprising the step of: 
accommodating object descriptors for the audiovisual encoded information in a separate 

channel corresponding to a logical channel not equal to 0 in accordance with the ITU-H.223 
standard. 

23. (New) The method according to claim 22, wherein: 

the object descriptors each corresponds to an Initial Object Descriptor in 
accordance with MPEG-4. 

24. (New) The method according to claim 1 8, further comprising the steps of: 

for object-based audiovisual, encoded information having a large number of objects, 
multiplexing the separate data streams into a common data stream; and 

transmitting the separate data streams in one of the individual transmission channels. 

25. (New) The method according to claim 24, wherein: 

the one of the individual transmission channels includes an ITU channel. 

26. (New) The method according to claim 16, wherein: 

the multiplexed separate data streams are in a transmission frame of the 
predefined frame-structured fransmission standard, and 
the method further comprises the step of: 

in addition to the multiplexed separate data streams in the transmission frame 
of the predefined frame-structured transmission standard, storing a signaling information 
that signals that multiplexed information packets of a constant length are fransmitted, on 
whose basis a synchronization can be carried out. 

27. (New) The method according to claim 26, wherein: 

the synchronization is in response to faulty data packets. 



3 



28. (New) The method according to claim 1 6, further comprising the step of: 
accommodating in an additional channel object descriptors. 

29. (New) The method according to claim 28, wherein: 

the additional channel includes logical channel 0 corresponding to the ITU 
H.245 standard. 

30. (New) The method according to claim 28, wherein: 

the object descriptors each corresponds to an Initial Object Descriptor in 
accordance with MPEG-4 for the audiovisual encoded information 

31 . (New) The method according to claim 28, further comprising the step of: 
storing allocation data between the separate data streams in the additional channel. 

32. (New) The method according to claim 31, wherein: 

the separate data streams include SL-packetized MPEG-4 elementary data 
streams and multiplexed data. 

33. (New) The method according to claim 31, wherein: 

the additional channel includes logical channel 0 in accordance with the ITU- 
H.245 standard. 

34. (New) The method according to claim 16, further comprising the step of: 

for a signaling information, providing data fields that identify, on the one hand, a 
constant length and, on the other hand, a packet size of multiplexed information packets. 

35. (New) The method according to claim 16, wherein: 

MPEG-4 data corresponds to the audiovisual encoded information that are 
preprocessed into FlexMux packets of a constant length, and 

the FlexMux packets of the constant length are multiplexed into a transmission 
frame in order to make possible a transmission in accordance with the ITU standard 
H.324. 
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36. (New) The method according to claim 1 6, further comprising the steps of: 

within an Adaptation Layer of a variable length in accordance with the ITU standard 
H.324, storing a plurality of MPEG-4 data in data packets of a constant length; and 

providing a length field in a header portion of each of the data packets, wherein: 
the length field corresponds to a synchronization identification. 

37. (New) The method according to claim 36, wherein: 

the synchronization identification is for synchronizing a receiver. 

38. (New) A terminal for a transmitter-side integrating of audiovisual encoded 
information into a predefined fi-ame-structured transmission standard, comprising: 

an arrangement for preprocessing the audiovisual encoded information into separate data 
streams for the terminal; 

an arrangement for delivering the audiovisual encoded information as the separate data 
streams; 

an arrangement for multiplexing the separate data streams into at least one data channel 
of the predefined fi-ame-structured transmission standard; 

an arrmgement for exchanging capabilities with other terminals as to types of data 
capable of being one of encoded and decoded and as to encoding and decoding operations that 
are supported, after a connection is established; and 

an arrangement for signaling, in accordance with data structures, specifications on the 
data type used, a decoding tool to be used, and encoding parameters including a data capacity. 

39. (New) A terminal for a receiver-side evaluation of audiovisual encoded information 
in a predefined fi-ame-structured transmission standard, comprising: 

an arrangement for decomposing up at least one multiplexed frame-structured data 
channel of a transmission standard into individual audiovisual data streams; 

an arrangement for exchanging capabilities with other terminals as to types of data 
capable of being one of encoded and decoded and as to encoding and decoding operations that 
are supported, after a connection is established; and 

an arrangement for signaling, in accordance with data structures, specifications on the 
data type used, a decoding tool to be used, and a data capacity. 
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Remarks 

This Preliminary Amendment cancels original claims 1 tol5, without prejudice, in 
the underlying PCX Application No. PCT/DE99/02770, and cancels substitute claims 1-15, 
without prejudice. The Preliminary Amendment also adds new claims 16-39. The new claims 
conform the claims to U.S. Patent and Trademark Office rules and do not add new matter to the 
application. 

In accordance with 37 C.F.R. § 1.121(b)(3), the Substitute Specification 
(including the Abstract, but without the claims) contains no new matter. The amendments 
reflected in the Substitute Specification (including Abstract) are to conform the Specification and 
Abstract to U.S. Patent and Trademark Office rules or to correct informalities. As required by 
37 C.F.R. § 1.121(b)(3)(iii) and § 1.125(b)(2), a Marked Up Version Of The Substitute 
Specification comparing the Specification of record and the Substitute Specification also 
accompanies this Preliminary Amendment. Approval and entry of the Substitute Specification 
(including Abstract) are respectfiilly requested. 

The underlying PCT Apphcation No. PCT/DE99/02770 includes an International 
Search Report, dated February 21, 2000, and an International Preliminary Examination Report 
dated December 13, 2000, copies of which are submitted herewith. 

AppUcants assert that the subject matter of the present apphcation is new, non- 
obvious, and usefiil. Prompt consideration and allowance of the apphcation are respectfiilly 
requested. 

Respectfiilly Submitted, 
KENYON& KENYON . 

Dated: ^Hhl By: lUh^^. OpPyv 

Richard L. Mayer 
(Reg. No. 22,490) 

One Broadway 

New York, NY 10004 

(212) 425-7200 
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A METHOD FOR I lAL CODED INFORMATION INTO 

ONE PREDEFINED, J^RAME-blKUUlUKiiu iJ^^ SMISSION STANDARD, AS WELL 
AS TERMINALS FOR THIS PURPOSE 

Field Of The Invention 



For the transmission of image and sound data at low bitrates for multimedia communications, 
on the basis of the ITU-H.324 specification, "Terminals for Low-Bitrate Multimedia 
Communications," a system is specified which is suited for video telephony applications. 



Background Information 

Figure 1 depicts a block diagram of a multimedia system according to the H.324 standard. In 
the block designated by reference numeral 1 are accommodated the modules which are 
defined in greater detail in H.324. Video codec 2 is configured in accordance with the 
method in ITU-H.263/H.261 . In order to smooth out any time differences between the image 
coding and the sound coding, a delay device 4 is connected downstream of audio codec 3 in 
accordance with ITU G.723. Device 5 functions to process data protocols, e.g., V.14 LAPM, 
etc., and device 6 processes control protocols in accordance with ITU H.245. The audiovisual 
data is supplied to codecs 2 and 3 by appropriate I/O (Input/Output) devices 7 and 8. The 
devices for processing protocols 5 and 6 receive their input data via devices 9 (User Data 
Applications) and 10 (System Control). The data streams of codecs 2, 3 and of protocol 
processing devices 5 and 6 are joined via multiplex/demultiplex device 1 1 in accordance with 
the H.223 standard. Modem 12, connected dovmstream, supplies V.34 conformal data for the 
combined data streams and V.25 conformal data for the system control data. Transmission 
network 13 is connected to block 1, along with an appropriate network control system 14. 



Summary Of The Invention 

The method in accordance with the present invention is suitable for integrating information 
that is encoded in an object-based manner, in particular according to the MPEG-4 
transmission standard, into one predefined, frame-structured transmission standard, in 
particular into an ITU standard, and it thus makes possible the transfer of the encoded 
MPEG-4 data. In contrast to conventional video coding methods, such as the video method 
discussed above in accordance with ITU-H.263/H.261 and the audio codec according to 
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G.723.1, the following specific advantages result: 

- object-based coding of synthetic and natural visual objects as well as audio objects, 

- improved coding efficiency, 

- improved visual error-resistance of the video coding, 

- individualized format for describing the arrangement of audiovisual objects, 

- synchronization of different audiovisual objects, 

- interaction with audiovisual objects. 

PCX Publication No. WO 98/21 846A describes multiplexing a multiplicity of substantially 
identical audiovisual data streams into one common intermediate data stream. For the 
intermediate data streams, measures are devised for detecting errors via an appropriate 
signaling in an initiaUzation phase. The intermediate data streams are grouped into one 
overall data stream. 

The publication by Lindbergh D., entitled "The H.324 Multimedia Communication 
Standard," lEE Communications Magazine, US, ISS Service Center, Piscataway, NJ, Vol. 
34, No. 12, December 1, 1996 (12/1/1996), pages 46-51, describes joining data streams. By 
exploiting data packets of a constant length in the frame structure, the error-resistance is 
increased. Synchronization to the data stream after an error is possible in a simple manner. 
An encapsulation or even the joining of different systems can be carried out in a simple 
manner. 

The publication Information Technology ~ Generic Coding of Audiovisual Objects Part 1 : 
System (Passage) ISO/IEC 14496-1, Final Committee Draft of International Standard, 
May 18, 1998, describes a similar signal joining is known. 

The method according to the present invention is based on two different concepts ~ 
designated below as Concept A and B. In general, each of the concepts is suitable by itself 
alone to secure the desired fimctionality ~ transmission of object-based coded audiovisual 
information ~, but Concept A can be advantageous with regard to larger numbers of objects 
(i.e., a large number of MPEG-4 data streams). A combination of the two concepts is also 
possible. 

Therefore, the method according to the present invention has the great advantage that 
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- all MPEG-4 data streams ~ for example, when a large number of objects are used ~ can 
be packetized using the MPEG-4 FlexMux specification, into one data stream, which 
contains all of the information for decoding (Concept A), and/or 

- a bi-directional communication can be carried out based on the total of MPEG-4 
functionahties, without requiring expensive additional adjustments of the MPEG-4 data to 
the formats of the communications standard. This is made possible through the consistent 
exploitation of the mechanisms made available by the multimedia commimications H.324 
standard (Concept B). 

Furthermore, in the exchange of capabilities and in the opening of a transmission channel, 
the same data structures are used which designate the type of data stream to be transmitted, 
the coding tools used, and their parameters, such as the data capacity. 

Through the application of data packets of a constant length (in Concept A) or through the 
exploitation of the firame structure of the multiplex H.223 standard embedded in H.324 (in 
Concept B), error-resistance is increased. Synchronization to the data stream after an error is 
possible in a simple manner. An encapsulation or even the joining of different systems, e.g., 
a combination of an H.324 platform and an MPEG-4 platform, can be carried out in a simple 
manner. 

Brief Description Of The Drawings 

Figure 1 depicts a block diagram of a multimedia system according to the H.324 standard. 

Figure 2a depicts a first block diagram of MPEG-4 multimedia systems based on an H.324 
terminal. 

Figure 2b depicts a second block diagram of MPEG-4 multimedia systems based on an 
H.324 terminal. 

Figure 3 depicts the setup of a Flex-Mux protocol in simple mode having a constant length. 



Figure 4 depicts the setup of a Flex-Mux protocol in Mux mode having a constant length. 
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Figure 5 dq)icts an Adaptation Layer Frame in accordance with ITU H.223. 



Figure 6 depicts the nesting of the data of the logical ITU channels. 
Figiare 7 depicts the header format. 

Figure 8 depicts an example for a Multiplex Entry Descriptor. 

Figure 9 depicts the integrating of packets of a constant length into the ITU Adaptation Layer 
of variable length. 

Detailed Description 

Before the method according to the present invention is described in detail, for the sake of 
greater intelligibility, the standards used will be briefly specified: 
The ITU-H.324 standard specifies a terminal which is composed of a video codec in 
accordance with H.261/H.263, an audio codec in accordance with G.723, a multiplexer in 
accordance with H.223, and a control protocol in accordance with H.245. The setup and the 
assembly of the individual components is described in this standard. 

The ITU-H.223 standard specifies a packet-oriented multiplex protocol for multimedia 
communications at low bitrates. It is used for the transmission of low bitrates between two 
multimedia terminals or between one terminal and a multipoint unit. The protocol makes 
possible the transmission of any combination of audio, video, and data information via one 
individual communications channel. The protocol is characterized by "low-delay" and low 
overhead. The protocol procedures for implementing the multiplex protocol are specified in 
the H.245 standard. 

The ITU-H. 245 standard, "Control Protocol for Multimedia Communication," specifies the 
syntax and the semantics of terminal information and messages as well as the procedures for 
the communications setup. The messages make possible the exchange of terminal 
capacities/capabilities, e.g., terminal A signals to terminal B that it can decode video data and 
which methods it supports. 
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Furthermore, a protocol is specified, permitting the rehable transmission of audiovisual data 
via an Acknowledge Message (terminal A signals to terminal B the correct reception of the 
data packet). 

The ITU-H.263/H.261 specifies the coding of compressed video data for channels at low 
bitrates. 

The G.723.1 standard specifies the decoding of compressed audio data for channels of low 
bitrates. 

For the transmission of MPEG-4 data using the H.245 standard, the following steps are 
required: 

1 . First, a capability exchange of the communicating terminals takes place in order to 
make possible reciprocal communication. The data transmission takes place in logical 
channel 0, corresponding to H.245, provided for this purpose. 

2. Next, the MPEG-4 decoders are configured. The specific MPEG-4 information 
required for this purpose, such as the hiitial Object Descriptor, is transmitted either 
via H.245, in particular logical channel 0, or via a separate logical ITU channel, in 
particular a logic channel not equal to 0, in accordance with the ITU-H.223 standard. 

3. Then, using the H.245 standard, the individual logical channels are opened for 
transmitting the audiovisual data streams. 

Regarding 1 : Capability Exchange 

For the capabihty exchange, it is sufficient to define an MPEG-4 capabihty within H.245, 
which can look as follows: 

Isl4496Capability 
{ 



streamType 



INTEGER (0..255) 



Profilehidication 



Levellndication 



INTEGER (0..255) 
INTEGER (0..255) 
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} 



or 



Isl4496Capability 

{ 

streamType 
DecoderSpecificInfo 



INTEGER (0..255) 



OCTET STRING OPTIONAL 



} 



or 



Isl4496Capability 

{ 



decConfDescr 



DecoderConfigDescriptor 



} 



The individual fields of the above data structures are discussed in greater detail in the 
MPEG-4 documents (ISO/IEC 14496). The advantage of this Capability Definition is based 
on the low data overhead and a reference to the specification within the MPEG-4 standard, 
therefore avoiding an overhead in additional definitions in the H.245 standard. The 
StreamType defines the type (i.e., the content) of the data stream, the Profile Indicator 
defines the decoder tools, and the level defines the parameters of these decoder tools. Among 
other things, these parameters are contained within MPEG-4, with the exception of the Level 
Indication, which has yet to be specified by MPEG. 

In Concept B, using the "data type" field when a logical channel is opened using the H.245 
function, OpenLogicalChannel, the isl4496Capability also functions to indicate the MPEG-4 
data type transmitted in this channel. 

Regarding 2: Configuration of the Decoders 

After the terminal capabilities have been defined using the Capability Exchange, the 
configuration of the decoders is carried out through the transmission of the Initial Object 
Descriptors or of the Object Descriptors. This takes place either using a request/confirm 
command in accordance with H.245, within which the Initial Object Descriptors are 
exchanged, or by opening a new logical ITU channel, which only contains the Initial Object 
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Descriptor or the SL-packetized Object Descriptor stream. 

Regarding 3: Opening the Logical Channels and Data Transmission 

After the configuration, the individual ITU channels are opened, lii general, the following 
applies: 

The audiovisual coded information, in particular in accordance with MPEG-4, is processed in 
separate data streams. An encoder, which generates an MPEG-4 conformal data stream, 
already delivers at its output a plurality of these separate data streams, in particular SL 
(Synchronization Layer)-packetized data streams. In Figure 2a and Figure 2b, the elementary 
data streams (El. Streams) are depicted at the "Elementary Stream Interface" of the Sync 
(synchronization) layer. In this regard, it should be noted that the header of the SL packets 
can also be configured at "zero" ~ i.e., omitted. Within this "Sync Layer," the packetizing of 
the elementary data streams takes place, which then can be picked off at the "Stream 
Multiplex Interface" for ftorther processing. 

According to Concept B, opening a logical channel takes place using the 
OpenLogicalChannel Message defined in H.245. In opening the specific logical channel, the 
"portNumber" field fimctions for the signaling of the assigned elementary data stream 
identification (ES_ID), using which the data streams are referenced on the MPEG-4 side. 
Using the "streamType" field, to which is assigned here the value of an Isl4496CapabiHty 
(thus the same data structures can be used as in the case of the Capability Exchange), in this 
context, the content of a logical channel (i.e., the MPEG-4 object type) is explicitly indicated 
in each case. In the actual ~ then subsequent ~ data transmission, in Concept B, each 
individual SL-packetized MPEG-4 data stream is picked off at the "Stream Multiplex 
Interface" and is transmitted in a logical ITU channel. For this purpose, the SL-packetized 
MPEG-4 data streams are fiirther processed by the H.233 AdaptationLayer as AL-SDU 
packets and are multiplexed using the H.223 standard (exemplary embodiment according to 
Figure 2a). This acceptance of the MPEG-4 firaming of the data into a fi-aming in accordance 
with H.223 (SL-PDU: = AL-SDU) increases the error-resistance and makes possible a simple 
resynchronization in the event that a packet was transmitted in an erroneous fashion. In 
addition, as a result, an otherwise additional adaptation of the MPEG-4 data format to the 
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format of the multiplexer is avoided. Concept B makes possible the (later) dynamic adding of 
further MPEG-4 data streams. 

For the conversion of Concept A, individual data streams are multiplexed into a total of only 
one data stream using the MPEG-4 FlexMux and are transmitted in a total of one logical ITU 
channel (exemplary embodiment in accordance with Figure 2b). For this type of transmission 
of MPEG-4 data streams using the FlexMux, additional Descriptors are defined, which make 
possible the connection setup. Only using these Descriptors is the recognition of the 
individual MPEG-4 data streams possible. These MPEG-4-specific data streams are 
multiplexed using the MPEG-4 FlexMux tools. In this context, the use of packets of a 
constant length is defined, as a result of which the error-resistance is increased. In this 
manner, synchronization to the data stream after an error is possible. 

Below, Concept A is described in detail. 

As Figure 2b shows, the following logical MPEG-4 objects (SL-packetized data streams) can 
be multiplexed using the MPEG-4 FlexMux tools into one transmission firame and can be 
transmitted in one logical ITU channel ALl: 
SL-audio, 
SL- video, 

SL-OCR (Object Clock Reference), 

SL-OD (Object Descriptor), 

SL-OCI (Object Content Information). 

In one easy transformation of Concept A, it is also possible to multiplex data exclusively of 
the same type (e.g., either only SL-audio or only SL-video) into one logical channel using 
the FlexMux tools, i.e., to transmit the entirety of the MPEG-4 data streams in a plurality 
(although less than in Concept B) of logical ITU channels. Under certain circumstances, this 
would make possible a simpler separation and decoding of the multiplexed data in the 
receiver. However, the original Concept A, i.e., the multiplexing of all MPEG data streams 
into one logical ITU channel using the FlexMux tools is considered below. 



Concept A O'ust as Concept B) makes possible the transmission of a plurality of MPEG-4 
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data streams of the same type, such as the transmission of apluraUty of audio streams for an 
image-accompanying sound in different languages. 

For the method in accordance with Concept A, MUXCODETABLE_Entry is transmitted 
during the initiahzation phase in order to configure the MPEG-4 FlexMux. 

In the end, the stipulated assignment of the individual ES streams to the data to be 
multiplexed is communicated to the MPEG-4 decoder. This is achieved using a Channel Map 
Table (also known as Stream Map Table). 

In addition to the Object Descriptors, these two information messages are used for the 
decoding. 

In order to insert the additional information messages MUXCODETABLE Entry and 
Channel Map Table into the Initial Object Descriptor, the definition of the new Descriptors is 
used. The latter are inserted in the form of Extension Descriptors into the Initial Object 
Descriptor. 

Class Channel Map Table Descriptor: bit (8) tag = to be defined 
{ 

bit(16) length: 

bit (15) streamCount: 

bit (1) MultiplexCodeFlag; 

For (i=0; i<streamCount; i++{ 

bit (16) ESJD; 

bit (8) FlexMuxChannel; 

IFMultiplexCodeFlag { 

bit (4) MultiplexCode; 
bit (4) reserved; 

} 

} 
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The part printed in bold indicates the Descriptor that is newly defined here. 



Similarly, the setup of a MuxCodeTableEntryDescriptor can be carried out: 



Class MuxCodeTableEntryDescriptor: bit (8) tag = to be defined 
{ 

bit (16) length 

bit (4) number of MuxCodeTableEntries; 

bit (1) constantLengthFlag; 

bit (3) reserved; 

IF constantLengthFlag 

bit(8) FlexMuxLength; 
For (j=0; j<numberOfIVIuxCodeTableEntries; 



bit (8) length; 

bit (4) MuxCode; 

bit (4) version; 

bit (8) substructureCount; 

for (1=0; KsubstructureCount; I++) ( 

bit (5) slotCount; 

bit (3) repetitionCount; 

for (k=0; k<slotCount; k++) ( 

bit (8) FlexMuxChannel (I) (k); 
bit (8) numberOfBytes (I) (k); 

} 

} 



The part printed in bold indicates the Descriptor newly defined here. The data field 
numberOfMuxCodeTableEntries makes it possible to transmit the maximum of 16 
MuxCodeTableEntries. Using the constantLengthFlag and the field FlexMuxLength, it is 
signaled to the receiver that the FlexMux packets having a constant length are transmitted in 
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the packet size FlexMuxLength + 2. 

The FlexMux packets defined in MPEG-4 are transmitted, on the one hand, in the simple 
mode in accordance with Figure 3 and, on the other hand, in the MuxCode in accordance 
with Figure 4. 

By using packets of a constant, nonlinear length, here 127 bytes, the upper 7 bits of the 
Length field can be used for synchronization. 

This increases the error-resistance and makes possible a resynchronization in the event that a 
Length field of one packet is faulty. 

These FlexMux packets are now merged into one ITU frame. In Figure 5, an Adaptation 
Layer (AL) frame in accordance with ITU-H.223 is depicted, having an AL-PDU (Protocol 
Data Unit) Payload Field. Due to the variable length of a FlexMux packet, it would no longer 
be possible to locate a new FlexMux packet after an error in the length field. This is 
particularly injurious if a plurality of MPEG-4 elementary streams (e.g., BIFS, OD, and 
video) are transmitted in one ITU channel. 

By using constant lengths within the MPEG-4 FlexMux packets, according to the present 
invention this is now possible. 

By using a constant length within the MPEG-4 FlexMux packet, this is now possible 
according to the present invention. 

The individual AL-PDU packets of variable lengths are now packetized using the 
multiplexer. 

The setup of the Multiplexed Layer and the basic integrating of the MPEG-4 FlexMux data 
stream are briefly discussed. 

A MUX Protocol Data Unit (MUX-PDU) is composed of a header and an information field, 
in nesting the data of the individual logical ITU channels. Figure 6 depicts the setup. 

The header is composed of individual fields, which are shown in Figure 1 . 



The 4-bit-large Multiplex Code indicates a MultiplexEntry transmitted over H.245, of which 
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a maximum of 1 5 different ones can be defined. 



The header Error Control Field is a 3-bit-large CRC Field, which permits error detection in 
the header. 

The 1-bit packet Marker Field marks the end of a MUX-SDU of a segmented logical 
channel. 

The information field shown in Figure 6 is configured using the MultiplexTable transmitted 
in H.245. 

The information field can at any time be closed at an Octet Border using a Closing Flag, but 
a MUX-SDU may not be interrupted by a non-segmentable channel. 

The MultiplexEntryDescriptor configures the H.223 Multiplexer and is transmitted in the 
initialization phase (Figure 8). 

In this Figure, LCN denotes: LogicalChanneUSfumber; RC: RepeatCoimt; UCF: 
UntilClosingFlag. 

The advantage is made clear in Figure 9: 

if, in an ITU channel, a plurality of MPEG data are transmitted and MPEG-4 packets of 
variable length are used, then none of the following FlexMuxPackets can any longer be 
decoded. The skillfiil use of the Length field as a synchronization marker makes possible the 
synchronization of the receiver. 

The transmitting terminal signals the packet length to the receiving terminal using the 
MuxCodeTableEntryDescriptor defined here, which is designated by a flag, which signals the 
use of FlexMux packets of a constant length and which also contains a field that estabUshes 
the length to be used. In this manner, high flexibility, in conjimction with great error- 
resistance, is assured. 



Of course, the present invention does not have to be used only for MPEG-4 data, but may 

also be used for other audiovisual coded information which is to be merged into one 
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standardized transmission frame and whose decoding is to take place in a manner that is 
simple and resistant to errors. 

Of course, the method presented can be realized in transmitter-side and receiver-side 
terminals. For transmitter-side integrating, the appropriate element is provided for preparing, 
or for delivering, audiovisual coded information, as well as an appropriate element for 
multiplexing the data streams, exchanging the capabilities, and signaling. For the receiver- 
side evaluation, what is desirable is a way for decomposing up the multiplexed data channels 
as well as means for exchanging capabilities and their valuation, as well as for evaluating the 
signaling. Since usually work is done in the interactive mode, subscriber terminals are 
furnished both for transmitting as well as for receiving operation. 
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Abstract Of The Disclosure 



To integrate audiovisual, encoded information into one predefined, frame-structured 
transmission standard, individual data streams are multiplexed into one or a plurality of data 
channels of the frame-structured transmission standard. In addition, the capabilities of the 
communicating terminals are exchanged. 
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[10191/1753] 

A METHOD FOR INTEGRATING AUDIOVISUAL CODED INFORMATION INTO 
ONE PREDEFINED, FRAME-STRUCTURED TRANSMISSION STANDARD, AS WELL 
AS TERMINALS FOR THIS PURPOSE 

Background Information 

For the transmission of image and soimd data at low bitrates for multimedia cormnunications, 
on the basis of the ITU-H.324 specification, "Terminals for Low-Bitrate Multimedia 
Communications," a system is specified which is suited for video telephony applications. 



Figure 1 depicts a block diagram of a multimedia system of this type according to the H.324 
standard. In the block designated by reference nimieral 1 are accommodated the modules 
which are defined in greater detail in H.324. Video codec 2 is configured in accordance with 

10 the method in ITU-H.263/H.261 . In order to smooth out any time differences between the 
image coding and the sound coding, a delay device 4 is connected downstream of audio 
codec 3 in accordance with ITU G.723. Device 5 fimctions to process data protocols, e.g., 
V.14 LAPM, etc., and device 6 processes control protocols in accordance with ITU H.245. 
The audiovisual data is supplied to codecs 2 and 3 by appropriate I/O (Input/Output) devices 

15 7 and 8. The devices for processing protocols 5 and 6 receive their input data via devices 9 
(User Data Apphcations) and 10 (System Control). The data streams of codecs 2, 3 and of 
protocol processing devices 5 and 6 are joined via multiplex/demultiplex device 1 1 in 
accordance with the H.223 standard. Modem 12, connected downstream, supphes V.34 
conformal data for the combined data streams and V.25 conformal data for the system 

20 control data. Transmission network 13 is connected to block 1, along with an appropriate 
network control system 14. 



Advantages of the Invention 



25 The method in accordance with the features of Claim 1 and the subclaims is suitable for 

integrating information that is coded in an object-based manner, in particular according to the 
MPEG-4 transmission standard, into one predefined, frame-structured transmission standard, 
in particular into an ITU standard, and it thus makes possible the transfer of the coded 
MPEG-4 data. In contrast to conventional video coding methods, such as the video method 



discussed above in accordance with ITU-H.263/H.261 and the audio codec according to 
G.723.1, the following specific advantages result: 

- obj ect-based coding of synthetic and natural visual obj ects as well as audio obj ects, 

- improved coding efficiency, 

- improved visual error-resistance of the video coding, 

- individualized format for describing the arrangement of audiovisual objects, 

- synchronization of different audiovisual objects, 

- interaction with audiovisual objects. 

The method accordmg to the present invention is based on two different concepts - 
designated below as Concept A and B. In general, each of the concepts is suitable by itself 
alone to secure the desired fimctionality ~ transmission of object-based coded audiovisual 
information ~, but Concept A can be advantageous with regard to larger numbers of objects 
(i.e., a large number of MPEG-4 data streams). A combination of the two concepts is also 
possible. 

Therefore, the method according to the present invention has the great advantage that 

- all MPEG-4 data streams ~ for example, when a large number of objects are used ~ can 
be packetized using the MPEG-4 FlexMux specification, into one data stream, which 
contains all of the information for decoding (Concept A), and/ or 

- a bi-directional communication can be carried out based on the total of MPEG-4 
fimctionalities, without requiring expensive additional adjustments of the MPEG-4 data to 
the formats of the communications standard. This is made possible through the consistent 
exploitation of the mechanisms made available by the multimedia commimications H.324 
standard (Concept B). 

Furthermore, in the exchange of capabilities and in the opening of a transmission channel, 
the same data stinctures are used which designate the type of data stream to be transmitted, 
the coding tools used, and their parameters, such as the data capacity. 

Through the application of data packets of a constant length (in Concept A) or through the 
exploitation of the firame stiiicture of the multiplex H.223 standard embedded in H.324 (in 
Concept B), error-resistance is increased. Synchronization to the data stream after an error is 
possible in a simple manner. An encapsulation or even the joining of different systems, e.g., 
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a combination of an H.324 platform and an MPEG-4 platform, can be carried out in a simple 
manner. 

Drawings 

The present invention is discussed in greater detail on the basis of the further drawings. The 
following are the contents: 

- Figiores 2a and 2b depict block diagrams of MPEG-4 multimedia systems based on an 
H.324 terminal, 

- Figure 3 depicts the setup of a Flex-Mux protocol in simple mode having a constant length, 

- Figure 4 depicts the setup of a Flex-Mux protocol in Mux mode having a constant length, 

- Figure 5 depicts an Adaptation Layer Frame in accordance with ITU H.223, 

- Figure 6 depicts the nesting of the data of the logical ITU channels, 

- Figure 7 depicts the header format, 

- Figure 8 depicts an example for a Multiplex Entry Descriptor, 

- Figure 9 depicts the integrating of packets of a constant length into the ITU Adaptation 
Layer of variable length. 

Description of the Exemplary Embodiments 

Before the method according to the present invention is described in detail, for the sake of 
greater intelligibility, the standards used will be briefly specified: 
The ITU-H.324 standard specifies a terminal which is composed of a video codec in 
accordance with H.261/H.263, an audio codec in accordance with G.723, a multiplexer in 
accordance with H.223, and a control protocol in accordance with H.245. The setup and the 
assembly of the individual components is described in this standard. 

The ITU-H.223 standard specifies a packet-oriented multiplex protocol for multimedia 
communications at low bitrates. It is used for the transmission of low bitrates between two 
multimedia termmals or between one terminal and a multipoint xmit. The protocol makes 
possible the transmission of any combination of audio, video, and data information via one 
individual communications channel. The protocol is characterized by "low-delay" and low 
overhead. The necessary protocol procedures for implementing the multiplex protocol are 
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specified in the H.245 standard. 

The ITU-H. 245 standard, "Control Protocol for Multimedia Communication," specifies the 
syntax and the semantics of terminal information and messages as well as the procedures for 
the communications setup. The messages make possible the exchange of terminal 
capacities/capabilities, e.g., terminal A signals to terminal B that it can decode video data and 
which methods it supports. 

Furthermore, a protocol is specified, permitting the reliable transmission of audiovisual data 
via an Acknowledge Message (terminal A signals to terminal B the correct reception of the 
data packet). 

The ITU-H.263/H.261 specifies the coding of compressed video data for channels at low 
bitrates. 

The G.723.1 standard specifies the decoding of compressed audio data for channels of low 
bitrates. 

For the transmission of MPEG-4 data using the H.245 standard, the following steps are 
required: 

1 . First, a capability exchange of the communicating terminals must take place in order 
to make possible reciprocal commimication. The data transmission takes place in 
logical channel 0, corresponding to H.245, provided for this purpose. 

2. Next, it is necessary to configure the MPEG-4 decoders. The specific MPEG-4 
information required for this purpose, such as the Initial Object Descriptor, is 
transmitted either via H.245, in particular logical channel 0, or via a separate logical 
ITU channel, in particular a logic channel not equal to 0, in accordance with the ITU- 
H.223 standard. 

3. Then, using the H.245 standard, the individual logical channels are opened for 
transmitting the audiovisual data streams. 

Regarding 1: Capability Exchange 
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For the capability exchange, it is sufficient to define an MPEG-4 capability within H.245, 
which can look as follows: 

Isl4496Capability 
{ 

streamType 
Profilelndication 
Levellndication 

} 

or 

Isl4496Capability 
{ 

StreamType INTEGER (0..255) 

DecoderSpecificInfo OCTET STRING OPTIONAL 

} 

or 

Isl4496CapabiUty 
{ 

decConfDescr DecoderConfigDescriptor 

} 

The individual fields of the above data structures are discussed in greater detail in the 
MPEG-4 documents (ISO/IEC 14496). The advantage of this Capability Definition is based 
on the low data overhead and a reference to the specification within the MPEG-4 standard, 
therefore avoiding an overhead in additional definitions in the H.245 standard. The 
StreamType defines the type (i.e., the content) of the data stream, the Profile Indicator 
defines the decoder tools, and the level defines the parameters of these decoder tools. Among 
other things, these parameters are contained within MPEG-4, with the exception of the Level 
Indication, which has yet to be specified by MPEG. 

In Concept B, using the "data type" field when a logical channel is opened using the H.245 
fimction, OpenLogicalChannel, the isl4496Capability also fimctions to indicate the MPEG-4 
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INTEGER (0..255) 
INTEGER (0..255) 
E^TEGER (0..255) 



data type transmitted in this channel. 



Regarding 2: Configuration of the Decoders 

After the terminal capabilities have been defined using the Capability Exchange, the 
configuration of the decoders is carried out through the transmission of the Initial Object 
Descriptors or of the Object Descriptors. This takes place either using a request/confirm 
command in accordance with H.245, within which the Initial Object Descriptors are 
exchanged, or by opening a new logical ITU channel, which only contains the Initial Object 
Descriptor or the SL-packetized Object Descriptor stream. 

Regarding 3: Opening the Logical Channels and Data Transmission 

After the configuration, the individual ITU channels are opened. In general, the following 
applies: 

The audiovisual coded information, in particular in accordance with MPEG-4, is processed in 
separate data streams. An encoder, which generates an MPEG-4 conformal data stream, 
already dehvers at its output a plurality of these separate data streams, in particular SL 
(Synchronization Layer)-packetized data streams. In Figure 2a and Figure 2b, the elementary 
data streams (El. Streams) are depicted at the "Elementary Stream Interface" of the Sync 
(synchronization) layer. In this regard, it should be noted that the header of the SL packets 
can also be configured at "zero" ~ i.e., omitted. Within this "Sync Layer," the packetizing of 
the elementary data streams takes place, which then can be picked off at the "Stream 
Multiplex Interface" for finther processing. 

According to Concept B, opening a logical channel takes place using the 
OpenLogicalChannel Message defined in H.245. In opening the specific logical channel, the 
"portNumber" field fiinctions for the signaling of the assigned elementary data stream 
identification (ES_ID), using which the data streams are referenced on the MPEG-4 side. 
Using the "streamType" field, to which is assigned here the value of an Isl4496CapabiHty 
(thus the same data structures can be used as in the case of the Capability Exchange), in this 
context, the content of a logical channel (i.e., the MPEG-4 object type) is explicitly indicated 

NY01 355243 v 1 6 



in each case. In the actual ~ then subsequent — data transmission, in Concept B, each 
individual SL-packetized MPEG-4 data stream is picked off at the "Stream Multiplex 
Interface" and is transmitted in a logical ITU chaimel. For this purpose, the SL-packetized 
MPEG-4 data streams are further processed by the H.233 AdaptationLayer as AL-SDU 
packets and are multiplexed using the H.223 standard (exemplary embodiment according to 
Figure 2a). This acceptance of the MPEG-4 framing of the data into a framing in accordance 
with H.223 (SL-PDU: = AL-SDU) increases the error-resistance and makes possible a simple 
resynchronization in the event that a packet was fransmitted in an erroneous fashion. In 
addition, as a result, an otherwise additionally necessary adaptation of the MPEG-4 data 
format to the format of the multiplexer is avoided. Concept B makes possible the (later) 
dynamic adding of ftirther MPEG-4 data streams. 

For the conversion of Concept A, individual data streams are multiplexed into a total of only 
one data stream using the MPEG-4 FlexMux and are transmitted in a total of one logical ITU 
channel (exemplary embodiment in accordance with Figure 2b). For this type of transmission 
of MPEG-4 data streams using the FlexMux, additional Descriptors are defined, which make 
possible the connection setup. Only using these Descriptors is the recognition of the 
individual MPEG-4 data streams possible. These MPEG-4-specific data streams are 
multiplexed using the MPEG-4 FlexMux tools. In this context, the use of packets of a 
constant length is defined, as a result of which the error-resistance is increased. In this 
manner, synchronization to the data sfream after an error is possible. 

Below, Concept A is described in detail. 

As Figure 2b shows, the following logical MPEG-4 objects (SL-packetized data sfreams) can 

be multiplexed using the MPEG-4 FlexMux tools into one transmission frame and can be 

transmitted in one logical ITU channel ALl : 

SL-audio, 

SL-video, 

SL-OCR (Object Clock Reference), 

SL-OD (Object Descriptor), 

SL-OCI (Object Content Information). 
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In one easy transformation of Concept A, it is also possible to multiplex data exclusively of 
the same type (e.g., either only SL-audio or only SL-video) into one logical channel using 
the FlexMux tools, i.e., to transmit the entirety of the MPEG-4 data streams in a plurality 
(although less than in Concept B) of logical ITU channels. Under certain circumstances, this 
would make possible a simpler separation and decoding of the multiplexed data in the 
receiver. However, the original Concept A, i.e., the multiplexing of all MPEG data streams 
into one logical ITU channel using the FlexMux tools is considered below. 

Concept A (just as Concept B) makes possible the transmission of a plurality of MPEG-4 
data streams of the same type, such as the transmission of a plurality of audio streams for an 
image-accompanying sotmd in different languages. 

For the method in accordance with Concept A, it is necessary to transmit 
MUXCODETABLE Entry during the initialization phase in order to configure the MPEG-4 
FlexMux. 

In the end, the stipulated assignment of the individual ES streams to the data to be 
multiplexed must be commimicated to the MPEG-4 decoder. This is achieved using a 
Channel Map Table (also known as Stream Map Table). 

In addition to the Object Descriptors, these two information messages are necessary for the 
decoding. 

In order to insert the additional information messages MUXCODETABLE_Eiitry and 
Channel Map Table into the Initial Object Descriptor, the definition of the new Descriptors is 
necessary. The latter are inserted in the form of Extension Descriptors into the Initial Object 
Descriptor. 

Class Channel Map Table Descriptor: bit (8) tag = to be defined 
{ 

bit(16) length: 

bit (15) streamCount: 

bit (1) MultiplexCodeFlag; 
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For (i=0; i<streamCount; 
bit(16)ES_ID; 
bit (8) FlexMuxChannel; 
IFMultiplexCodeFlag{ 



bit (4) MultiplexCode; 
bit (4) reserved; 



} 

} 

} 

The part printed in bold indicates the Descriptor that is newly defined here. 



Similarly, the setup of a MuxCodeTableEntryDescriptor can be carried out: 

Class MuxCodeTableEntryDescriptor: bit (8) tag = to be defined 
{ 

bit (16) length 

bit (4) number of MuxCodeTableEntries; 

bit (1) constantLengthFlag; 

bit (3) reserved; 

IF constantLengthFlag 

bit(8) FlexMuxLength; 
For 0=0; j<numberOfMuxCodeTableEntries; j++{ 



bit (8) length; 

bit (4) MuxCode; 

bit (4) version; 

bit (8) substructureCount; 

for (1=0; i<substructureCount; 1++) ( 

bit (5) slotCount; 

bit (3) repetitionCoimt; 

for (k=0; k<slotCount; k++) ( 



bit (8) FlexMuxChannel (I) (k); 
bit (8) numberOfBytes (I) (k); 



The part printed in bold indicates the Descriptor newly defined here. The data field 
numberOfMuxCodcTableEntries makes it possible to transmit the maximum of 16 
MuxCodeTableEntries. Using the constantLengthFlag and the field FlexMuxLength, it is 
signaled to the receiver that the FlexMux packets having a constant length are transmitted in 
the packet size FlexMuxLength + 2. 

The FlexMux packets defined in MPEG-4 are transmitted, on the one hand, in the simple 
mode in accordance with Figure 3 and, on the other hand, in the MuxCode in accordance 
with Figure 4. 

By using packets of a constant, nonlinear length, here 127 bytes, the upper 7 bits of the 
Length field can be used for synchronization. 

This increases the error-resistance and makes possible a resynchronization in the event that a 
Length field of one packet is faulty. 

These FlexMux packets must now be merged into one ITU firame. In Figure 5, an Adaptation 
Layer (AL) fi-ame in accordance with ITU-H.223 is depicted, having an AL-PDU (Protocol 
Data Unit) Payload Field. Due to the variable length of a FlexMux packet, it would no longer 
be possible to locate a new FlexMux packet after an error in the length field. This is 
particularly injurious if a plurality of MPEG-4 elementary streams (e.g., BIFS, OD, and 
video) are transmitted in one ITU channel. 

By using constant lengths within the MPEG-4 FlexMux packets, according to the invention 
this is now possible. 

By using a constant length within the MPEG-4 FlexMux packet, this is now possible 
according to the present invention. 

The individual AL-PDU packets of variable lengths are now packetized using the 
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multiplexer. 

The setup of the Multiplexed Layer and the basic integrating of the MPEG-4 FlexMux data 
stream are briefly discussed. 

A MUX Protocol Data Unit (MUX-PDU) is composed of a header and an information field, 
in nesting the data of the individual logical ITU channels. Figure 6 depicts the setup. 

The header is composed of individual fields, which are shown in Figure 1. 

The 4-bit-large Multiplex Code indicates a MultiplexEntry transmitted over H.245, of which 
a maximum of 15 different ones can be defined. 

The header Error Control Field is a 3-bit-large CRC Field, which permits error detection in 
the header. 

The 1-bit packet Marker Field marks the end of a MUX-SDU of a segmented logical 
channel. 

The information field shown in Figure 6 is configured using the MultiplexTable transmitted 
in H.245. 

The information field can at any time be closed at an Octet Border using a Closing Flag, but 
a MUX-SDU may not be interrupted by a non-segmentable channel. 

The MultiplexEntryDescriptor configures the H.223 Multiplexer and is transmitted in the 
initialization phase (Figure 8). 

In this Figure, LCN denotes: LogicalChannelNumber; RC: RepeatCount; UCF: 
UntilClosingFlag. 

The advantage is made clear in Figure 9: 

if, in an ITU channel, a plurality of MPEG data are transmitted and MPEG-4 packets of 
variable length are used, then none of the following FlexMuxPackets can any longer be 
decoded. The skillfiil use of the Length field as a synchronization marker makes possible the 
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synchronization of the receiver. 

The transmitting terminal signals the packet length to the receiving terminal using the 
MuxCodeTableEntryDescriptor defined here, which is designated by a flag, which signals the 
use of FlexMux packets of a constant length and which also contains a field that estabUshes 
the length to be used. In this manner, high flexibility, in conjunction with great error- 
resistance, is assured. 

Of course, the present invention does not have to be used only for MPEG-4 data, but may 
also be used for other audiovisual coded information which is to be merged into one 
standardized transmission fi-ame and whose decoding is to take place in a manner that is 
simple and resistant to errors. 

Of course, the method presented can be realized in transmitter-side and receiver-side 
terminals. For transmitter-side integrating, the appropriate means must be provided for 
preparing, or for delivering, audiovisual coded information, as well as appropriate means for 
multiplexing the data streams, exchanging the capabilities, and signaling. For the receiver- 
side evaluation, means are necessary for decomposing up the multiplexed data channels as 
well as means for exchanging capabilities and their valuation, as well as for evaluating the 
signaling. Since usually work is done in the interactive mode, subscriber terminals are 
fiunished both for transmitting as well as for receiving operation. 
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What is claimed is: 



1. A method for integrating audiovisual, coded information into one predefined, frame- 
structured transmission standard, comprising the following steps: 

the audiovisual coded information is preprocessed into separate data streams, 
respectively or is suppHed in the form of separate data streams; 
the individual data streams are multiplexed into one or a plurality of data channels in 
the frame-structured transmission standard; 

capabilities, in particular the encoding and decoding capabihties, of the terminals that 
communicate with one another are exchanged after a connection is estabUshed; 
for signaling, data structures of and encoding standard are used which contain 
specifications on the data type used, the decoding tool to be used, and the encoding 
parameters, such as the data capacity. 

2. The method as recited in Claim 1, 

characterized in that the data structures that are used in exchanging capabilities and in 
opening a transmission channel are selected so as to be identical. 

3. The method as recited in Claim 1 or 2, 

characterized in that, by way of example, for object-based, audio visually encoded 
information having a low number of objects, the appropriate data streams are packetized 
using a multiplexer, in particular a multiplexer in accordance with the H.223 standard, and 
are transmitted in individual transmission channels, in particular ITU channels, in accordance 
with the H.245 standard. 

4. The method as recited in Claim 3, 

characterized in that a data field ("portNumber" - Field) is used to assign the individual 
packets within the transmission channels assigned to them, the data field in an MPEG-4 
transmission containing an identification of the individual elementary data streams (ES_ID). 

5. The method as recited in Claim 3, 

characterized in that the data format of the MPEG-4 output data streams (SL-PDUs) is taken 
over directly from the multiplexing process, so that no fijrther reformatting is necessary. 
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6. The method as recited in Claim 3, 

characterized in that the object descriptors, in particular the "Initial Object Descriptor" in 
accordance with MPEG-4, are accommodated for audiovisual coded information in a 
separate channel, in particular in a logical channel not equal to 0 in accordance with the ITU- 
H.223 standard. 

7. The method as recited in Claim 3, 

characterized in that, for object-based audiovisual, coded information ~ for example, when 
the number of objects is large ~, the corresponding data streams are multiplexed into one 
common data stream and are transmitted in one transmission channel, in particular in an ITU 
channel. 

8. The method as recited in one of Claims 1, 2, or 7, 

characterized in that, in addition to the multiplexed data streams in the transmission frame of 
the frame-structured transmission standard, a signaling information is stored, which makes 
reference to the fact that multiplexed information packets of a constant length are 
transmitted, on whose basis a synchronization can be cmied out, in particular in response to 
faulty data packets. 

9. The method as recited in one of Claims 1, 2, or 7, 

characterized in that object descriptors, in particular the "Initial Object Descriptor" in 
accordance with MPEG-4 for audiovisual coded information, are accommodated in an 
additional channel, in particular in logical channel 0 corresponding to the ITU H.245 
standard. 

10. The method as recited in one of Claims 1 , 2, 7, or 8, 

characterized in that allocation data between the separate data sfreams, in particular SL- 
packetized MPEG-4 elementary data streams and the multiplexed data in the additional 
channel, in particular in the logical channel 0 in accordance with the ITU-H.245 standard, are 
stored. 

1 1 . The method as recited in one of Claims 1, 2, 7, 8, or 9, 

characterized in that, for the signaling information, data fields are provided, which identify, 
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on the one hand, the constant length and, on the other hand, the packet size of the 
multiplexed information packets. 

12. The method as recited in one of Claims 1, 2, or 7 - 10, 

characterized in that, as audiovisual coded information, MPEG-4 data are used, which are 
preprocessed into FlexMux packets of a constant length, and that these FlexMux packets of a 
constant length are multiplexed into a transmission frame, making possible a transmission m 
accordance with the ITU standard H.324. 

13. The method as recited in one of Claims 1 through 12, 

characterized in that, within the "Adaptation Layer" of variable length in accordance with the 
ITU standard H.324, a plurality of MPEG-4 data are stored in data packets of a constant 
length, a length field being provided, in each case, in the header portion of these data 
packets, the length field being able to be used as synchronization identification, in particular 
for synchronizing a receiver. 

14. A terminal for the transmitter-side integrating of audiovisual, encoded information into 
one predefined frame-structured transmission standard, comprising the following features: 

means for preprocessing the audiovisual coded information into separate data streams 
for the terminal, respectively for delivering in the form of separate data streams to the 
terminal; 

means for multiplexing the individual data streams into one or a pluraHty of data 

channels of the frame-structured transmission standard; 

means for exchanging capabilities, in particular the encoding and decoding 

capabiUties, with other terminals, in particular after a connection is established; 

means for signaling, with the us of data structures, the specifications on the data type 

used, the decoding tool to be used, and the encoding parameters, such as the data 

capacity. 

15. A terminal for the receiver-side evaluation of audiovisual encoded information in a 
predefined frame-structured transmission standard, comprising the following features: 

a means for decomposing up one or a plurality of multiplexed frame-structured data 
channels of one transmission standard into individual audiovisual data streams, 
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a means for exchanging capabilities, in particular encoding and decoding capabilities, 
with other terminals, in particular after a connection is estabhshed; 
means for signaling, with the use of data structures, the specifications on the data type 
used, the decoding tool to be used, and the data capacity. 
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Abstract 



To integrate audiovisual, encoded information into one predefined, frame-structured 
transmission standard, individual data streams are multiplexed into one or a pliu-ality of data 
channels of the frame-structured transmission standard. In addition, the capabilities of the 
commimicating terminals are exchanged. 

(Figure 2a) 
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DECLARATION AND POWER OF ATTORNEY 

As a below named inventor, I hereby declare that: 

My residence, post office address and citizenship are as stated below next to 

my name. 

I believe I am an original, first and joint inventor of the subject matter which is 
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INTEGRATING AUDIOVISUAL CODED INFORMATION INTO ONE 
PREDEFINED, FRAME-STRUCTURED TRANSMISSION STANDARD, AS WELL 
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119 of any foreign application(s) for patent or inventor's certificate hsted below and have also 
identified below any foreign appUcation(s) for patent or inventor's certificate having a filing 
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Please direct all telephone calls to Richard L. Mayer at (212) 425-7200. 
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and that all statements made on information and behef are beheved to be true; and further that 
these statements were made with the knowledge that willful false statements and the like so 
made are punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the 
United States Code and that such willful and false statements may jeopardize the vahdity of 
the apphcation or any patent issued thereon. 
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